Suboptimal Local Alignments Across Multiple Scoring Schemes
نویسندگان
چکیده
Sequence alignment algorithms have a long standing tradition in bioinformatics. In this paper, we formulate an extension to existing local alignment algorithms: local alignments across multiple scoring functions. For this purpose, we use the Waterman-Eggert algorithm for suboptimal local alignments as template and introduce two new features therein: 1) an alignment of two strings over a set of score functions and 2) a switch cost function δ for penalizing jumps into a different scoring scheme within an alignment. Phylogenetic footprinting, as one potential application of this algorithm, was studied in greater detail. In this context, the right evolutionary distance and thus the scoring scheme is often not known a priori. We measured sensitivity and specificity on a test set of 21 human-rodent promoter pairs. Ultimately, we could attain a 4.5-fold enrichment of verified binding sites in our alignments.
منابع مشابه
Local Weighting Schemes for Protein Multiple Sequence Alignment
This paper describes three weighting schemes for improving the accuracy of progressive multiple sequence alignment methods: (1) global profile pre-processing, to capture for each sequence information about other sequences in a profile before the actual multiple alignment takes place; (2) local pre-processing; which incorporates a new protocol to only use non-overlapping local sequence regions t...
متن کاملEvaluation of scoring functions for protein multiple sequence alignment using structural alignments
The process of aligning a group of protein sequences to obtain a meaningful Multiple Sequence Alignment (MSA) is a basic tool in current bioinformatic research. The development of new MSA algorithms raises the need for an efficient way to evaluate the quality of an alignment, in order to select the best alignment among the ones produced by the available algorithms. A natural way to evaluate the...
متن کاملEnumerating Suboptimal Alignments of Multiple Biological Sequences E ciently
The multiple sequence alignment problem is very applicable and important in various elds in molecular biology. Because the optimal alignment that maximizes the score is not always biologically most signi cant, providing many suboptimal alignments as alternatives for the optimal one is very useful. As for the alignment of two sequences, this suboptimal problem is well-studied 6;9;12 , but for th...
متن کاملEnumerating suboptimal alignments of multiple biological sequences efficiently.
The multiple sequence alignment problem is very applicable and important in various fields in molecular biology. Because the optimal alignment that maximizes the score is not always biologically most significant, providing many suboptimal alignments as alternatives for the optimal one is very useful. As for the alignment of two sequences, this suboptimal problem is well-studied, but for the ali...
متن کاملAligning Alignments Exactly ( Extended abstract )
A basic computational problem that arises in both the construction and local-search phases of the best heuristics for multiple sequence alignment is that of aligning the columns of two multiple alignments. When the scoring function is the sum-of-pairs objective and induced pairwise alignments are evaluated using linear gap-costs, we call this problem Aligning Alignments. While seemingly a strai...
متن کامل